A method for identifying splice sites and translation start sites in human genomic sequences.

نویسندگان

  • Ki-Bong Kim
  • Kiejung Park
  • Eun Bae Kong
چکیده

We describe a new method for identifying the sequences that signal the start of translation, and the boundaries between exons and introns (donor and acceptor sites) in human mRNA. According to the mandatory keyword, ORGANISM, and feature key, CDS, a large set of standard data for each signal site was extracted from the ASCII flat file, gbpri.seq, in the GenBank release 108.0. This was used to generate the scoring matrices, which summarize the sequence information for each signal site. The scoring matrices take into account the independent nucleotide frequencies between adjacent bases in each position within the signal site regions, and the relative weight on each nucleotide in proportion to their probabilities in the known signal sites. Using a scoring scheme that is based on the nucleotide scoring matrices, the method has great sensitivity and specificity when used to locate signals in uncharacterized human genomic DNA. These matrices are especially effective at distinguishing true and false sites.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A method for identifying splice sites and translational start sites in eukaryotic mRNA

This paper describes a new method for determining the consensus sequences that signal the start of translation and the boundaries between exons and introns (donor and acceptor sites) in eukaryotic mRNA. The method takes into account the dependencies between adjacent bases, in contrast to the usual technique of considering each position independently. When coupled with a dynamic program to compu...

متن کامل

Exon amplification: a strategy to isolate mammalian genes based on RNA splicing.

We have developed a method, exon amplification, for fast and efficient isolation of coding sequences from complex mammalian genomic DNA. This method is based on the selection of RNA sequences, exons, which are flanked by functional 5' and 3' splice sites. Fragments of cloned genomic DNA are inserted into an intron, which is flanked by 5' and 3' splice sites of the human immunodeficiency virus 1...

متن کامل

Identification of a Novel Splice Site Mutation in RUNX2 Gene in a Family with Rare Autosomal Dominant Cleidocranial Dysplasia

Introduction: Pathogenic variants of RUNX2, a gene that encodes an osteoblast-specific transcription factor, have been shown as the cause of CCD, which is a rare hereditary skeletal and dental disorder with dominant mode of inheritance and a broad range of clinical variability. Due to the relative lack of clinical complications resulting in CCD, the medical diagnosis of this disorder is challen...

متن کامل

FunSiP: a modular and extensible classifier for the prediction of functional sites in DNA

MOTIVATION Many problems in genome annotation are tackled by using a classification model to predict functional sites such as splice sites, translation start sites or stop codons. Locating the correct position of these sites remains one of the most important but also one of the most difficult issues in the structural annotation of genomes. Most of the software currently in use is written for a ...

متن کامل

Identifying Potential Regulatory Sequences of Alternative Splicing

Alternative splicing is an important mechanism that contributes to expanding protein diversity by generating multiple protein isoforms from a single gene. We have previously reported computational approach to infer alternative splicing patterns from Mus musculus full-length cDNA clones and microarray data [4]. Although we have predicted a large number of unreported splice variants, general mech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of biochemistry and molecular biology

دوره 35 5  شماره 

صفحات  -

تاریخ انتشار 2002